OPEN 3 ACCESS Freely available online 



•0-PLOS I ONE 



/?£rVariants and Haplotype Analysis in a Cohort of Czech (S\ 
Patients with Hirschsprung Disease 

Eliska Vaclavikova 1 *, Sarka Dvorakova 1 , Richard Skaba 2 , Lucie Pos 2 , Vlasta Sykorova 1 , Tereza Halkova 1 , 
Josef Vcelak 1 , Bela Bendlova 1 

1 Department of Molecular Endocrinology, Institute of Endocrinology, Prague, Czech Republic, 2 Department of Paediatric Surgery, 2nd Faculty of Medicine, Charles 
University and Hospital Motol, Prague, Czech Republic 



Abstract 

Hirschsprung disease (HSCR) is a congenital aganglionosis of myenteric and submucosal plexuses in variable length of the 
intestine. This study investigated the influence and a possible modifying function of fi£T proto-oncogene's single nucleotide 
polymorphisms (SNPs) and haplotypes in the development and phenotype of the disease in Czech patients. Genotyping of 
14 SNPs was performed using TaqMan Genotyping Assays and direct sequencing. The frequencies of SNPs and generated 
haplotypes were statistically evaluated using chi-square test and the association with the risk of HSCR was estimated by 
odds ratio. SNP analysis revealed significant differences in frequencies of 11 polymorphic RET variants between 162 HSCR 
patients and 205 unaffected controls. Particularly variant alleles of rsl 864410, rs2435357, rs2506004 (intron 1), rsl 800858 
(exon 2), rsl 800861 (exon 13), and rs2565200 (intron 19) were strongly associated with increased risk of HSCR (p<0.00000) 
and were over-represented in males vs. females. Conversely, variant alleles of rsl 800860, rsl 799939 and rsl 800863 (exons 7, 
11, 15) had a protective role. The haploblock comprising variants in intron 1 and exon 2 was constructed. It represented a 
high risk of HSCR, however, the influence of other variants was also found after pruning from effect of this haploblock. 
Clustering patients according to genotype status in haploblock revealed a strong co-segregation with several SNPs and 
pointed out the differences between long and short form of HSCR. This study involved a large number of SNPs along the 
entire RET proto-oncogene with demonstration of their risk/protective role also in haplotype and diplotype analysis in the 
Czech population. The influence of some variant alleles on the aggressiveness of the disease and their role in gender 
manifestation differences was found. These data contribute to worldwide knowledge of the genetics of HSCR. 
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Introduction 

Hirschsprung disease (HSCR) is a congenital developmental 
malformation characterised by the absence of enteric ganglion 
cells of myenteric and submucosal plexuses in the intestine. The 
incidence of the disease is 1 per 5000 live births. Short-segment 
aganglionosis (80%) and long-segment aganglionosis (20%) are 
classified according to the length of the aganglionic segment. The 
short-segment form of HSCR (S-HSCR), comprising recto 
sigmoid HSCR and ultra-short segment HSCR, affects the distal 
portion, part of the anal canal and, in contrast to long-segment 
HSCR, occurs four times more often in males than females. The 
long-segment form of HSCR (L-HSCR) can present as total 
colonic aganglionosis (TCA) extending from the rectum up to the 
terminal ileum and, in rare cases, as nearly total bowel (NTBA) or 
total intestinal aganglionosis (TIA) comprising nearly total or the 
whole intestine. It occurs as an isolated disorder in 70% of cases, 
about 12% of cases have a chromosomal abnormality and 18% of 
cases have additional congenital anomalies [1]. 

Isolated HSCR appears as a sporadic (85%) or less commonly as 
a familial disorder (15%). To date, more than 14 genes and 5 



susceptibility loci have been associated with the disease [1,2]. 
However, the inactivating germline mutations in the RET proto- 
oncogene play the major role in the pathogenesis. The RET 
(Rearranged during Transfection), a gene consisting of 21 exons 
and located on chromosome 10qll.2, encodes a transmembrane 
tyrosine kinase receptor. It is expressed in neural crest-derived cells 
and plays an important role in the development, proliferation and 
differentiation of neuroendocrine cells. The various alterations 
(missense mutations, deletions, insertions, frame shifts) have been 
found along the entire gene. Inactivating germline AET mutations 
are detected in about 50% of familial and 15-20% of sporadic 
HSCR cases [1]. 

Although the detection rate of germline RET mutations is 
relatively low, linkage analysis has shown that nearly all familial 
HSCR patients are in linkage with the RET proto-oncogene [3]. 
Therefore, noncoding variants and single nucleotide polymor- 
phisms (SNPs) as well as some specific haplotypes of the RET 
proto-oncogene have been revealed to be a potential low 
susceptibility loci and have a function of genetic modifying factors 
in HSCR pathogenesis. Several RET polymorphisms have been 
investigated in the association with HSCR. Previous studies [3-5] 
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described the variants in the coding regions - polymorphism in 
exon 2 (rs 1800858, p.Ala45Ala, C.135G/A) was overrepresented in 
HSCR patients with respect to controls, as well as polymorphisms 
in exon 7 (rs 1800860, p.Ala432Ala, C.1296G/A) and in exon 13 
(rs 1800861, p.Leu769Leu, C.2307T/G) are risk factors for HSCR. 
On the other hand, SNPs in exons 11 (rsl 799939, p.Gly691Ser, 
C.2071G/A), 14 (rsl800862, p.Ser836Ser, C.2508C/T) and 15 
(rsl800863, p.Ser904Ser, C.2712C/G) were underrepresented in 
HSCR cases, suggesting a protective role of these SNPs. However, 
results could differ depending on methods, cohorts and frequencies 
of SNPs. 

Consequendy, the study of SNPs has been focused on 
noncoding regions - 5 'untranslated region (UTR), intron variants 
and 3' UTR. The investigation of the strong association of 
rsl 800858 (p.Ala45Ala, C.135G/A) with HSCR was extended 
upstream of exon 2 to intron 1 and promoter. A haplotype 
spanning 27 kb of the 5 'UTR region to exon 2 was determined as 
HSCR-associated [2,6-8]. The investigation of 3'UTR variants 
underrepresented in HSCR patients resulted in the suggestion of 
the protective haplotype in 3'UTR [9-10]. 

In this large case-control study, we investigated the possible role 
of variants in the RET proto-oncogene in cohorts of 162 Czech 
patients with HSCR and 205 unaffected control individuals and 
the role of these variants in gender manifestation differences and 
their influence on the aggressiveness of the disease. We carried out 
a screening of 14 polymorphisms throughout the RET proto- 
oncogene to identify risk or protective SNPs and haplotypes 
associated with the HSCR phenotype from the Czech population. 

Material and Methods 

In this study, blood samples were obtained from 162 Czech 
patients with HSCR (121 males and 41 females). The patients 
were chosen for molecular genetic analysis prospectively (88 cases 
operated on from 2003 till now) as well as retrospectively (74 cases 
operated on between 1979 and 2003). According to the length of 
the aganglionic segment, the cohort consisted of 1 1 7 patients with 
short-segment HSCR (recto-sigmoid form; patients with ultra- 
short segment were not involved in the study), 41 patients with 
long-segment aganglionosis (including 20 patients with total 
colonic aganglionosis and 2 patients with nearly total bowel 
aganglionosis), and 4 patients with an unspecified form of HSCR. 
The control group included 205 Czech healthy individuals (95 
males, 110 females). 

Ethics statement 

This study was approved by the Ethical Committee of the 
Institute of Endocrinology and University Hospital Motol, Prague. 
A signed informed consent for this study was obtained from each 
patient or legitimate representative who participated. 

Genetic analysis 

Genomic DNA was isolated from peripheral blood leukocytes 
using the QIAamp DNA Blood Kit (Qiagen, Germany) and the 
QuickGene 610 L machine (Fujifilm, Japan). Studied polymor- 
phisms are listed in Table 1 . Five of SNPs were analysed during 
routine sequencing testing of HSCR patients for mutations in 
exons 10, 11, 13, 14, 15 and 16 in the AETproto-oncogene due to 
the risk of medullary thyroid cancer. The standard conditions for 
amplification of RET exons were provided previously [11] as well 
as sequenase reaction preparation [12]. Direct sequencing was 
performed on the CEQ_ 8000 sequencing machine (Beckman 
Coulter) and analysed by CEQ, 8000 sequencing software. 
Sequence traces were compared with RET reference sequence 



(NG_007489.1). Genotyping of the other 9 SNPs in patients and 
all 14 SNPs in control samples were determined by real-time 
polymerase chain reaction (PCR) with TaqMan Genotyping 
Assays (Applied Biosystems) on LightCycler 480 Real-Time PCR 
System (Roche) with using non template control in each run. 

Statistical analysis 

The frequencies of studied SNPs were statistically evaluated and 
compared using the NCSS programme and chi-square test with 
establishing p-value. The results were considered as statistically 
significant if the p-value was less than 0.05. Association with risk of 
HSCR was estimated by odds ratio (OR) and their 95% 
confidence interval (CI). For generating haplotypes, the Haploview 
programme (version 4.1) was used and a haplotype block of 
polymorphisms which were in linkage disequilibrium was 
constructed using Gabriel's methods. Consequently, haplotypes 
of the particular haploblock were generated in each individual 
patient and control and diplotypes were estimated using the 
PHASE programme (version 2.1). 

Results 

SNP analysis 

Statistical evaluation of studied SNPs revealed significant 
differences in genotype and allele distribution of polymorphic 
variants between HSCR patients and normal controls (Table 1). 
No significant deviation from Hardy-Weinberg equilibrium was 
found in the control cohort. Studied cohorts significantly differed 
in 1 1 out of 14 investigated SNPs. In SNPs rsl864410, rs2435357, 
rs2506004, rsl 800858 the variant allele dominated in HSCR 
patients (all p<0. 00000) and their carriership was associated with 
more than 6.6-fold elevated risk for development of HSCR 
compared with the wild-type allele carriership. These 4 SNPs were 
in complete linkage disequilibrium and went together with almost 
the same genotype distribution in all cohorts. The variant allele 
was also over-represented in rsl800861 (p<0.00000, OR = 2.76, 
95% CI = 2.00-3.81) and rs2565200 (p<0.00000, OR = 3.01, 
95% CI — 2.16-4.20). In contrast, the variant allele was under- 
represented in two SNPs in linkage disequilibrium: rsl 799939 (p< 
0.00000, OR = 0.31, 95% CI = 0.19-0.48) and rsl800863 (p< 
0.00000, OR = 0.32, 95% CI = 0.20-0.50). Significant differences 
in allele distribution were established in rsl 800860, rsl 43948954 
and rs2435355. Allele distributions of SNPs considering the 
phenotype of the disease (L- and S-HSCR) and gender were 
described in Table SI and Table S2. 

Three SNPs (rsl 1 1264957, rsl800862, rsl43948954) were 
detected in very low frequencies of variant allele (2.9%, 2.9%, 
and 0.5%, respectively in controls). 

Haplotype and diplotype analysis 

Using the Haploview programme for generating haplotypes, 
two haplotype blocks were identified (Figure 1) comprising 10 out 
of 14 investigated polymorphisms that were in linkage disequilib- 
rium. The first haploblock was composed of 4 SNPs (rsl 8644 10, 
rs2435357, rs2506004, rsl 800858) that were very closely related 
together. The second haploblock consisting of 6 SNPs (rsl 799939, 
rsl800861, rs2472737, rsl800863, rs2565200, rs2435355) was 
very heterogeneous and differed in distribution between particular 
cohorts. Therefore, these SNPs were rather evaluated separately. 
Moreover, 4 SNPs not included in any of two haploblocks due to 
low frequencies in cohorts or its localization in the gene were also 
investigated separately. 

The two most frequent haplotypes of 5' region haploblock were 
represented by TTAA and GCCG (Table 2). The presence of only 
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Figure 1. Haplotype blocks generated by the Haploview Programme in cohorts of HSCR patients and control population. 

scheme is shown with confidence bounds. LD values are reported in D'. 
doi:1 0.1 371 /journal.pone.0098957.g001 



The 



two main haplotypes was caused by above-mentioned trend that in 
nearly all cases these 4 SNPs went together in wild type or variant 
allele mode. The distribution of haploblock-haplotypes signifi- 
candy differed between HSCR patients and controls (p<0. 00000). 
Although the most frequent haplotype in HSCR patients was 
TTAA (HSCR 70.7% vs. controls 26.1%), the most frequent in 
controls was haplotype GCCG (HSCR 27.5% vs. controls 71.2%). 
The haplotype TTAA had a 6.83-fold (95% CI = 4.93-9.45, p< 
0.00000) elevated risk for development of HSCR than other 
haplotypes. The different distribution of estimated combinations of 
haplotypes in two alleles of each subject resulted from this 
converse representation of haplotypes. The diplotype TTAA,T- 
TAA had a 17,56-fold (95% CI = 9.25-33.35, p<0.000()0) 
elevated risk for the development of HSCR than other diplotypes. 

If we consider the phenotype of the disease with haploblock 
status, variant allele frequencies in L-HSCR and S-HSCR were 
significandy different from controls, but did not vary between 
forms of HSCR (Table 3). The risk of HSCR was higher in the S- 
form than in L-form resulting from odds ratios (7.62 vs. 5.66 for 



rs 18644 10). By comparing frequencies of SNPs in haploblock 
between male patients and male controls as well as female patients 
and female controls, similar significant results were obtained- 
contrary representation of the variant allele in controls vs. patients. 
In comparison between male and female patients, there were also 
statistically significant differences. The different allele distribution 
in haploblock (for rsl864410 variant allele in male 76.0% vs. 
female 59.8%) was related to the different risk for development of 
HSCR in males (OR = 8.20, 95% CI = 5.32-12.65) and females 
(OR =3.96, 95% CI = 2.33-6.74). 

Distribution of 3' region SNPs with respect to 5' region 
haploblock status 

Other SNPs located in 3' RET region were examined 
depending on what haplotype in 5' part of gene was carried. 
Given allelic distribution of TTAA and GCCG haplotypes 
(Table 4), frequencies of variant alleles of rs 1800861 and 
2565200 as well as rs 1799939 and rs 1800863 significandy differed 
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Table 2. Distribution of haplotypes and diplotypes of 5' region haploblock in HSCR patients and controls. 





Haplotypes of haploblock* 


HSCR (%) 


Controls (%) 


p-value (for x 2 ) 


OR (95% CI) 


p-value (for OR) 


1 TTAA 


229 (70.7) 


107 (26.1) 


0.00000 


6.83 (4.93-9.45) 


0.00000 


2 GCCG 


89 (27.5) 


292 (71.2) 


0.00000 


0.15 (0.11-0.21) 


0.00000 


Diplotypes of haploblock* 


HSCR (%) 


Controls (%) 


p-value (for x 2 ) 


OR (95% CI) 


p-value (for OR) 


1,1 TTAA,TTAA 


88 (54.3) 


13 (6.3) 


0.00000 


17.56 (9.25-33.35) 


0.00000 


1,2 TTAA,GCCG 


50 (30.9) 


75 (36.6) 


0.25081 


0.77 (0.50-1 .20) 


0.29951 


2,2 GCCG,GCCG 


18 (11.1) 


106 (51.7) 


0.00000 


0.12 (0.07-0.20) 


0.00000 



* Haplotypes/diplotypes with occurrence <2% in both cohorts are not included. 
doi:1 0.1 371 /journal.pone.0098957.t002 



between cases and controls with the same risk/ protective trends as 
were shown in previous SNP analysis (Table 1). In genotype 
distribution of these haplotypes, there was possible to observe the 
tendency of the increasing/ decreasing variant allele frequencies 
depending on carriership of TTAA in two, one or no alleles (Table 
S3). 

In carriers of TTAA haplotype, differences between L-HSCR 
and S-HSCR phenotype were intensified in variant alleles of 
rsl800861, rs2472737, rs2565200 and rs2435355, whereas in 
carriers of GCCG no significant differences between phenotypes 
were found (Table 5). In TTAA carriers, the variant alleles of 
rs 1800861 and rs2565200 were over-represented in S-HSCR 
(55.6% and 54.1%, respectively) vs. L-HSCR (37.7% and 36.8%, 
respectively). The variant alleles of rs2472737 and rs2435355 were 
over-represented in L-HSCR (39.6% and 44.2%, respectively) vs. 
S-HSCR (25.0% and 32.9%, respectively). Genotype distribution 
of haploblock-haplotypes TTAA and GCCG describing differenc- 
es in 3' region SNPs frequencies between L- and S-HSCR is 
shown in Table S4. No differences between male and female 
patients and controls were observed considering the 5' region 
haplotype (Table S5). 

Discussion 

The genetics of HSCR is complex. It is believed that it can be 
inherited in a dominant or recessive trait, but probably it is 
polygenic with incomplete penetrance, genetic heterogeneity, 
variable expression of the disease and a large number of additive 
syndromes. The main impact of the RET proto-oncogene 
mutation is a loss of function (haploinsufficiency). Almost all 
HSCR cases are linked to the RET locus despite no detected 
mutation. Therefore, noncoding RET variants and SNPs must 
play at least a modifying role in the remaining HSCR cases. 

We focused our study on SNPs along the entire RET proto- 
oncogene and selected 14 promising SNPs in coding and intronic 
sequences and 3'UTR. Allele frequencies in our tested normal 
population were similar with frequencies in previously reported in 
a European control population [13] and gready varied from an 
Asian control population [2,10,14,15]. 

Our data show that mainly 8 SNPs are very important in the 
development of HSCR. We found a strong association of variant 
alleles of 4 investigated SNPs in intron 1 and exon 2 grouped in 
haploblock (rsl864410, rs2435357, rs2506004, rsl800858) with 
HSCR (p<0. 00000). These alleles were the most frequent in our 
patient cohort and formed the haplotype TTAA with a very 
elevated risk (OR= 6.83). In the homozygous diplotype TTAA , T- 
TAA the risk was even more increased (OR =17.56). These 
findings are consistent with other studies describing these SNPs as 



belonging to HSCR-causing region covering 27 kb in total. It is a 
highly conserved region called MSC+9.7 (Multi species conserved) 
and starts 4 kb upstream of the RET transcription start site and 
going along the way to the beginning of exon 2. This risk 
haplotype decreases RET promoter activity, reduces the binding 
affinity of TTF-1 (thyroid transcription factor 1), disrupts a 
binding site for transcription factor SOX 10, decreases enhancer 
for RET expression and thus regulates RET expression which was 
confirmed by in vitro studies [7,8,16,17]. 

In previous studies, there was suggested the hypothesis about 
the different role of SNPs in two linkage disequilibrium regions. 
Besides the risk haplotype at the 5' end of the RET proto- 
oncogene, a protective function of the 3' half of the gene was 
proposed. SNPs in this region encompassing rsl 799939 up to 
3'UTR variants were reported as underrepresented in HSCR 
patients [3,9,10]. We cannot confirm this hypothesis because we 
detected several SNPs with increased risk in this region. The main 
contradiction with the hypothesis of the protective 3' half RET 
gene haplotype is the high overrepresentation of rs2565200 
located in intron 19 and rs2435355 located in 3'UTR. SNP 
rsl 800861 in exon 13 was also strongly associated with HSCR as 
previously described [3,15]. The risk effect of rsl 800861 and 
rs2565200 in Czech patients was confirmed. Moreover, after 
pruning from the effect of 5' region haploblock, variant allele 
frequencies of rsl800861 and rs2565200 still significantly differed 
in comparison of patients and controls and were associated with 
the risk of HSCR. In rs2435355, a divergent tendency was 
observed in Czech patients that differed from a study of Chinese 
patients [10] which instead had a protective role. Perhaps the 
discrepancy in the theory of protective 3' half gene role can be due 
to various studied cohorts and ethnic differences. We defined three 
main protective SNPs in our series located in exons 7 (rsl 800860), 
11 (rsl 799939) and 15 (rsl 800863). The protective role of 
rsl 799939 and rsl 800863 was described previously [3]. Consid- 
ering the 5 ' region haploblock status in investigated carriers, only 
rsl799939 with rsl800863 variant alleles stayed protective. In 
contrast to our results, some studies detected rsl 800862 variant 
allele underrepresented in HSCR with a protective role [18], and 
in the Chinese population the variant allele was even absent [14], 
which was correlated with increased HSCR incidence in the 
Chinese population. We identified this SNP in similar low 
frequencies in both our case and control cohorts, which was not 
in agreement with these findings. It seems that the other SNPs 
studied do not have any major role in the development of HSCR 
in our series. The low frequencies of variant allele were also 
identified in rsl 1 1264957 that was in linkage disequilibrium with 
rsl 800862, and rsl 43948954, where a significandy higher 
representation of the variant allele in patients was noticed. 
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However, due to very low frequency, this significance on the 
phenotype could not be definitively revealed. 

In our study, we also focused on the evaluation of the possible 
influence of the variant alleles on aggressiveness of the disease (L- 
HSCR vs. S-HSCR) and their role in gender manifestation 
differences. The risk of variant alleles in the 5' region haploblock 
was highly elevated both in L-HSCR and S-HSCR patients in 
relation to controls. Interestingly, the frequency of these risk alleles 
was even 6% higher in the less aggressive S-HSCR form. Similar 
results were described in rs2435357 [8] and rsl800858 [5], 
whereas no difference was revealed between L-HSCR and S- 
HSCR by Lantieri et al [3]. However, in our cohort, the higher 
frequency of variant alleles in S-HSCR was likely caused by a 
higher representation of male patients in S-HSCR (78%) than in 
the L-HSCR cohort (71%). A gender effect was previously 
described in rs2435357 where the variant allele was present in 
65% of males vs. 56% of females [19]. Our data documented this 
trend not only in SNPs of the haploblock (76% of males vs. 60% of 
females) but also in risk variant alleles of rs 1800861 and 
rs2565200. The frequency of these two risk alleles was about 
10% higher in males, but nonsignificant. In variants of the 
haploblock, the associated risk was much more profound (OR = 8 
in males vs. OR = 4 in females). It is not clear if over- 
representation of rs 1800861 and rs2565200 in S-HSCR patients 
and a significant difference related to L-HSCR was also influenced 
by sex differences between cohorts. The risk of L-HSCR was 
influenced especially by rs2472737 and rs2435355 where the 
variant allele elevated the risk of L-HSCR nearly two fold 
compared to S-HSCR. Both these findings-rs 1 80086 1 and 
rs2565200 variant allele over-representation in S-HSCR and 
rs2472737 and rs2435355 variant allele over-representation in L- 
HSCR-were even confirmed considering the 5' region haploblock 
status, but only in TTAA carriers. 

Conclusions 

This study aimed to identify the risk and protective SNPs and 
haplotypes that could be associated with Hirschsprung disease. 
From our results, we propose several risk and several protective 
SNPs and haplotypes. The influence of some variant alleles on the 
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